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Percolation theory has been largely used in the study of structural properties of complex networks 
such as the robustness, with remarkable results. Nevertheless, a purely topological description is 
not sufficient for a correct characterization of networks behaviour in relation with physical flows 
and spreading phenomena taking place on them. The functionality of real networks also depends on 
the ability of the nodes and the edges in bearing and handling loads of flows, energy, information 
D ' and other physical quantities. We propose to study these properties introducing a process of inho- 

00 , mogeneous percolation, in which both the nodes and the edges spread out the flows with a given 

probability. 

Generating functions approach is exploited in order to get a generalization of the Molloy-Reed 
Criterion [M. Molloy and B. Reed, Random Struct, and Alg. 6, 161-180 (1995)] for inhomogeneous 
joint site bond percolation in correlated random graphs. A series of simple assumptions allows the 
O ' analysis of more realistic situations, for which a number of new results are presented. In particular, 

for the site percolation with inhomogeneous edge transmission, we obtain the explicit expressions of 
the percolation threshold for many interesting cases, that are analyzed by means of simple examples 
and numerical simulations. Some possible applications are debated. 
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INTRODUCTION 



\ First introduced in the study of gelation processes in the 1940s PIQj percolation theory has been largely used in 
statistical mechanics in order to describe topological phase transitions in lattice systems, such as the emergence of 
global properties when some physical parameter (temperature, concentration, etc.) exceeds a critical value [H H E| • 
1 ^ 1 . More precisely, a site percolation process can be sketched as follows. Each site of the lattice is occupied with a uniform 
probability q, called occupation probability, and empty with probability 1 — q. A cluster is a connected set of occupied 
y—i . sites. When the value q of the occupation probability is larger than a critical value q c , the percolation threshold, a 
cluster with size of the same order of the system, i.e. infinite in the thermodynamic limit, appears. Analogously, 
the process can be defined on the bonds joining neighbouring sites, with the only difference that the site occupation 
probability q is replaced by a bond occupation probability. The bond percolation corresponds to the site percolation 
\ on the conjugate lattice (obtained exchanging bonds with sites and viceversa), and gives similar results. 

Recently, percolation theory has been successfully introduced in the field of complex networks, with some seminal 
' works on undirected uncorrelated random graphs El E El , IToj ■ A number of other works followed, studying the cases 
of correlated 0,0,0,0] and directed p5l llrill7lllSj] random graphs. 

The simplest example of percolation on graphs is the topological phase transition occurring in the generation of 
an Erdos-Renyi random graph |l9l l20|. Indeed, the static construction algorithm proposed by Erdos and Renyi for 
a random graph with N vertices requires no embedding space and, for N — > 00, corresponds to the mean-field bond 
percolation on an infinite lattice Q. In this case, the bond percolation threshold is the value of edge (bond) occupation 
probability, at which an infinite cluster appears, namely, in the language of graph theory, a giant component emerges. 
The celebrated Molloy-Reed criterion for percolation |2lJ,|22j shows that, for this model, a giant component exists if 
and only if the edge occupation probability is larger than 1/N. Hence, even simple examples like this one suggest 
that percolation is directly related to the structural properties of graphs. 
• i-h , Iii addition, the correct functioning of real complex networks largely relies on their structural properties: the 
resilience to dama ge a nd the robustness to external attacks, cascade failures, or to collapses due to traffic jams and 
overloadings 1M H HI El IH • A common way to evaluate the robustness of a graph is that of studying the 
size Qf of the giant component after the removal of a fraction / of vertices. Due to nodes removal, Qf is expected 
to decrease, but the remarkable point is that a random removal of nodes (also called random failure process) can be 
mapped exactly on a site percolation problem, with / = 1 — q. Percolation theory predicts the presence of a threshold 
value f c = 1 — q c above that the giant component disappears, i.e. the network is fragmented in many disconnected 
components of very small sizes (~ (log TV)) with respect to the total number N of vertices in the graph. As for 
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lattices, also for general random networks site percolation and bond percolation (or nodes and edge percolation, using 
network's language) are complementary notions. 

Percolation has found applications also in the context of spreading phenomena on networks, such as epidemic 
models l28l I29I I30L l3l| : for some of them, the connection with percolation is a longtime topic (see for example 
|32l I33L f34L l35 | l. while for other models, more rigorous descriptions, based on the analysis of population equations of 
epidemiology, are required j2EE3,EEHSE3j0,E3)E3j0|- The models of epidemic spreading solved using percolation 
theory identify vertices with individuals and edges as pairwise contacts between them 30] . A way to distinguish 
disease-causing contacts from safe ones, is that of introducing a parameter, called transmissibility, accounting for the 
disease's transmission probability along the edges. Without any claim of characterizing all real properties related to 
transmissibility, the model's behaviour is easily determined using edge percolation, that establishes the presence of a 
giant component as function of the transmission probability, and states the possibility for global epidemic outbreaks. 
In this context, the existence of a giant component means that we can reach an large (infinite) number of other nodes 
of the same graph starting by one of them and moving along the edges. 

Ncrvcrthelcss, the edge percolation cannot encode all the features of a spreading process. Real spreading phenomena 
are more complex, they need time and resources to traverse the network and the spreading rates themselves depend 
on the properties of the nodes and the edges and on the details of the process. Many examples come from daily 
life: in the Internet, connections between computers are real cables, thus the transmission along them depends on 
their bandwidth; in air-transportation networks, the edges are weighted with the number of available seats, that is a 
measure of the their capacity; in social and contact networks as well, the importance of links is related to the type of 
the relationships between individuals. 

The aim of this paper is that of developing a more general approach to spreading phenomena using percolation, 
endowing the nodes and the edges with some properties that quantify their ability to spread flows throughout the 
network. 

The idea of a percolation process in which the basic elements (nodes and edges) can present inhomogeneous prop- 
erties goes back to the physical studies on the conduction in resistor networks models (see Ref. |6| and references 
therein). A typical way to introduce disorder on the edges (or on the nodes) is that of assigning to each of them 
a random number between and 1, then removing all edges (nodes) in a selected interval of values. Percolation 
properties are studied as functions of the fraction of edges (nodes) that remains. Powerful methods, as that of the ef- 
fective medium theory f |45l l46fl ) have provided a general comprehension of the large disorder regime of such processes. 
Mathematicians have studied similar types of site and bond percolation models using the theory of random fields, but 
only partial or isolated results have been achieved (see Refs. or Refs. 0)H3,|^| for a general introduction). 

Using the well-known approach of generating functions we are able to study a class of percolation problems on 
random graphs that are related to the original random resistor network model and that we will call inhomogeneous 
percolation models on random graphs. In particular, we will study these models in relation with spreading phenomena. 

In order to express the ability of an edge to transfer some physical quantities (information, energy, diseases, 
etc.), we introduce an edge transition probability Tij, generally dependent on the properties of the vertices i, j and 
of the edge (i, j) itself. In addition, in real processes the nodes have different ability to spread, thus they should 
be supplied with a node traversing probability q^. In this manner, the actual size of the giant component can be 
affected by the non-optimal or non-homogeneous flow through nodes and edges, the most general model performing 
an inhomogeneous joint site-bond percolation. On the other hand, information about simpler processes such as site 
percolation with inhomogeneous bonds (edges) or bond percolation with inhomogeneous sites (nodes) can be easily 
obtained assuming, in turn, qi or T{j as uniform. In these particular cases we will specify them as node (q) or edge 
(T) occupation probability. 

In relation with the modern theory of complex networks |5^ . l53l l54l l55| , this inhomogeneous percolation model 
can also provide further information about the robustness of random and real networks. It allows to study the 
behaviour of the functional giant component, that can be defined as the part of the giant component that can bear 
and handle a large flow of information, ensuring the global well-functioning of the network. Edges with small or large 
transmission abilities are topologically identical, but spreading processes can be deeply influenced by the existence of 
fragile connections collapsing under the load of large flows. 

The present paper is organized as follows. In Section [n] we propose an intuitive way of modeling a simple problem 
of random resistor networks as a site percolation with noisy edges, leading to approximated but interesting results. A 
rigorous approach to the general problem of inhomogeneous joint site-bond percolation, by means of the generating 
functions formalism, is developed in Section UTTl It contains a deep analysis of the case of correlated random graphs 
starting from very general assumptions. The main result obtained by this technique is a generalization of the Molloy- 
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Reed criterion for the existence of the giant component |2jJ, |22J. With further assumptions, one easily recovers 
well-known results on the threshold value for site percolation in uncorrelated random graphs, with some interesting 
intermediate situations. Section IIVI is devoted to applications: we derive analytically the expression of the critical 
threshold of site percolation for different functional forms of the transition probability, testing some of these results 
by numerical simulations. Conclusions on the relevance of this approach are exposed in Section [V] Some technical 
details on the derivation of formulae presented in Section IlIII arc reported in the Appendices. 



II. INTUITIVE ARGUMENTS FOR SITE-PERCOLATION IN UNCORRELATED RANDOM GRAPHS 

WITH NOISY EDGES 

In this section, we present an introductory example that can be related to the study of random resistor networks 
in which each edge is given a random conductance value between and 1. The probability of traversing 

an edge is a function of this conductance. In resistor networks it is usually treated as a step function (the Heaviside 
0(Tij > To) for a reasonable value To), so that transmission occurs only along a fraction of the edges (with sufficiently 
high conductance) and, for those edges, it is optimal. Here, we are interested in spreading phenomena, in which the 
probability of passing through an edge should rather be a linear function of {Tj}; for this reason, we consider directly 
the set of {Tj} as the set of edge transition probabilities. A simple method allows to study this model as a site 
percolation with noisy edges (the name comes from mathematical research in the field of wireless communication |56| ) 
and to describe how the presence of edge transition probabilities affects general structural properties of networks. 

We use an intuitive derivation of the percolation criterion that follows directly from the approach introduced by 
Cohen et al. in Ref. [9| for uncorrelated sparse random graphs. Let us call p^ the probability that an edge in the 
network does not lead to a vertex connected via the remaining edges to the giant component (infinite cluster) and Tj 
the probability that a flow leaving the node i passes trough the edge («, j), reaching the node j. Both pij and Tj are 
defined in the interval [0, 1]. 

Note that the definition of transition probabilities resembles the one originally introduced by Newman in Ref. |30tl3l| 
with the name of transmission probability (or transmissibility) to study epidemic spreading. For this reason, in the 
following, we will sometimes refer to the transition probability as the transmissibility of an edge. 

In general, pij depends on the transition probability Tj of the edge (i, j), by a factor 1 — Tj, and on the probability 
that the node j, if reached, does not belong to the infinite cluster. This contribution can be computed as the joint 
probability that each one of the remaining edges emerging from j does not belong to the giant component. The 
different contributions are assembled as in Fig. ^A, giving the recursive expression 

Pij = 1 - T ij + T ij II Pjh , (1) 

h e v(j) 

h # i 

that contains a product of kj — 1 terms pjh if kj is the degree of the node j and V(j) is its set of neighbours. This 
recursive procedure is not closed, but the introduction of some hypotheses on the set of transition probabilities {T} 
allows a statistical reformulation of Eq. ^ Let us suppose that the transition probabilities are random variables with 
a given ditribution px, and the average overall probability p that a randomly chosen edge does not belong to the giant 
component depends only on some general properties of that distribution such as the mean, the variance, etc. (i.e. 
p = p({T})). Moreover, by picking up an edge at random in a random graph, the probability that it is connected to 
an edge of degree k is ^ k -#y-, where pu is the degree distribution of the graph. We can now write a self-consistent 
equation for p({T}), 

p({T}) = l-T + T^2 ^PttT})*- 1 , (2) 

in which T is a realization of the independent and identically distributed (i.i.d.) random variables {T} in the interval 
[0, 1]. Averaging both the members of the Eq.|21on pt, we obtain that the probability p depends only on the average 
value (T), i.e. p({T}) = p((T)), and the equation becomes 

p((T)) = 1 - (T) + (T) lyKCO)^ 1 - I[p((T))} . (3) 

The equation is graphically represented in Fig.^-B. Apart from the trivial solution p = 1, another solution p = p* < 1 
exists if and only if ^| P =i > 1- The curve I[p], indeed, is positive in p — (/[0] = 1 — (T) + (T)pi/ (k) > 0), therefore 
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FIG. 1: A) Graphical representation of the Eq. Fat striped lines indicate the contribution coming from the unknown 
probability pij that an edge does not belong to the infinite cluster. The first term on the right-hand side represents the 

contribution coming from the probability 1 — Ty that a flow does not traverse the edge the second term accounts for the 

probability that the edge is traversed times the unknown contribution pjh of other edges attached to j. 
B) Graphical representation of the self-consistent Eq. |5] Fat striped lines correspond to the unknown uniform probability 
p({T}) that an edge does not lead to the giant component. Full lines mean that edges can be crossed, with probability T, while 
the dashed edge corresponds to the contribution 1 — T that an edge cannot be crossed. 



3^Ip=i ^ 1 means that it crosses the line f(p) — p in a point < p* < 1. The condition on the derivative of the r.h.s. 
of Eq. corresponds to 

CO { > {k) K 1 > 1 • (4) 

A generalization of the Molloy-Reed criterion [2lJ for the existence of a giant component in presence of noisy edges 
immediately follows, 

meaning that, when the transition probabilities are i.i.d. random variables, a giant component exists if and only if 
the inequality is satisfied. The case of uniform transition probabilities is exactly the same, with uniform value T 
instead of the average value (T). Note that, while in the case of perfect transmission (T^ = 1) the usual formulation 
of the Molloy-Reed criterion is recovered [2l|. when (T) < 1 the r.h.s. of Eq. [SJcan grow considerably, affecting the 
possibility of observing percolation: the smaller is the average transition probability, and the larger are connectivity 
fluctuations (k 2 ) needed to ensure the presence of a giant component. 

For a random graph with poissonian degree distribution (i.e. pk = (k) /k\), the criterion in Eq.0 corresponds 
exactly to have (k) > 1/(T). We show in Fig. [3 A the results of numerical simulations of the giant component's 
computation on an Erdos-Renyi random graph with = 10 4 nodes and with random transition probabilities uniformly 
distributed in [a, b] with < a < b < 1. A giant component clearly appears when the mean connectivity (k) exceeds 
the inverse of mean transmissibility value (6 — a)/2. Considering different distributions (e.g. binomial distributions) 
for the i.i.d. random variable T does not affect the results. 

For heterogeneous graphs with broad degree distribution, the inequality in Eq. [S] is satisfied thanks to the huge 
fluctuations of the node degrees ensuring the l.h.s. to be larger than I + 1/(T). In particular, when the graph has 
power-law degree distribution pk ~ mk" 1 (2 < 7 < 3, m is the minimum degree), fluctuations diverge in the limit 
N — ► 00. In this case, the giant component always exists if A^ is large enough. If 7 > 3, the second moment is finite, 
and Eq. [5] provides the following bound for the average transition probability necessary to have a giant component 
(computed using the continuum approximation for the degree), 

7-3 

~ 7 (m- 1) + 3-2to ' ^ 
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FIG. 2: A) Size of the giant component Q c vs. average degree (k) in an Erdos-Renyi random graph in which the edge transition 
probabilities are uniformly random distributed with mean value (T). Simulations are performed on a sample of 100 graphs 
with N — 10 4 nodes. The different curves refer to different values of average transition probability: (T) = 1 (circles), 0.75 
(squares), 0.5 (up triangles) and 0.25 (down triangles). The inset reports the numerical values (circles) of (k) at which the 
giant component appears for different (T) along with the values predicted by the theory (crosses). 

B) Size of the giant component Q c vs. the exponent 7 in an power-law graph in which the edge transition probabilities are 
uniformly random distributed with mean value (T). Simulations are performed on a sample of 100 graphs with TV = 10 4 nodes, 
cut-off K = 10 2 and minimum degree m = 1. The different curves refer to different values of average transition probability: 
(T) = 1 (circles), 0.75 (squares), 0.5 (up triangles) and 0.25 (down triangles). 
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The inequality is always satisfied for m > 1 + 1/(T), while for m < 1 + 1/ (T) it is satisfied when 7 < 3t i_( m _i)(y) 
m = 1, an infinite (uncorrelated) scale- free graph presents a giant component of order O(N) only when 7 < 3+(T) _ 
At 7 = 3, logarithmic corrections should be taken into account |9(. 

Since real networks are large but finite, it is important to compute the finite-size effects for power-law graphs with 
exponent 2 < 7 < 3, introducing a cut-off n(N) on the degree. Following Ref. Q, the expression for (k 2 )/{k) can 
be easily computed in the continuum approximation for the degree, obtaining an explicit relation for the bound on 7 
and (T), 
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The r.h.s. has a maximum at 7 = 2 + l/log(/«), in which it assumes the value 

lOg(fv) - 1 



777 K 



log( K ) _|_ 1 _ log(K) 



(7) 



(8) 



For 777 = 1, this expression reaches a maximum value ~ 0.6 for k ~ 10, then it decreases monotonously for larger 
values of the cut-off. For k ~ 10 2 , it reduces to 0.1, showing that the existence of a giant component can be ensured 
even for small (T) if the cut-off is sufficiently large. However, in Fig. I2J-B we have reported the size of the giant 
component for power-law random graphs as a function of the exponent 7 for different values of the average edge 
transition probability (T). The simulations are performed on graphs of = 10 4 nodes with cut-off k — 10 2 . They 
clearly show that in finite power-law networks, the real size of the giant component is dramatically affected by low 
transition probabilities. 

Note that inverting this generalization of Molloy-Reed criterion (Eq. |SJ) , and looking at the critical value of average 
transition probability (T) that is sufficient for the graph to admit a giant component (as we have done in Eq. 0) 
gives results completely equivalent to those obtained by standard (homogeneous) bond percolation with (T) equal to 
the uniform edge occupation probability T (see, for example, Ref. 

Finally, the argument of Cohen et al. can be used to compute the threshold of site percolation with noisy edges. 
If q is the (uniform) node occupation probability in an uncorrelated random graph, then a randomly chosen node, 
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whose natural degree in case of q = 1 is k', will assume a degree k < k' with probability 

q k (l-qf- k , (9) 



V k / 

that means that the corresponding degree distribution P q (k) is related to the degree distribution P q= i(k') by 

P q (k) = £ P q=1 (k') ( I ) q\l - qf- k ■ (10) 



k'=k 



It follows that (k) q — q(k) q=1 and (k 2 ) q = q(k 2 ) q=1 + q(l — q)(k) q=1 , that introduced into the expression Eq. |5]for 
the Molloy-Reed criterion gives the expression of the threshold value for site percolation with noisy edges, 

* (11) 

Eg. II II shows that decreasing the average transmission capacity of the edges (i.e. edge transition probabilities) causes 
a rise of the value of node occupation probability necessary to ensure percolation. 

The intuitive method we have proposed in this section is correct for uncorrelated graphs where the edges are 
supplied with uniform or homogeneously randomly distributed transition probabilites. The next section is devoted to 
the formalism of generating functions that is appropriate for deriving more general results for the case of two-point 
correlated random graphs. 



III. GENERAL THEORY OF INHOMOGENEOUS PERCOLATION IN CORRELATED RANDOM 

GRAPHS 

Percolation theory on graphs has been formally developed by means of generating functions E3, , a method 
that can be successfully applied also in the present case of inhomogencous joint site-bond percolation. Since in 
general we will consider edge transition probabilities depending on the properties of the two extremities of the edges, 
the correlations between neighbouring nodes should be taken into account. More precisely, a very general type 
of generating functions is introduced in order to describe the properties of random graphs with vertex-vertex pair 
correlations (also called Markovian networks ^(j) an d with heterogeneously distributed transition probabilities on the 
edges. Our main result is a general version of the Molloy-Reed criterion for the existence of a giant component in the 
case of joint site-bond percolation and, consequently, the expressions of the critical threshold for the two separate cases 
of site percolation and bond percolation. We conclude the section discussing some examples in which the functional 
form of the transition probabilities allows explicit calculations of the percolation threshold. Our notation is the same 
as in recent expositions (see Refs. 0,113), nevertheless we refer to Appendix-lAl for a brief introduction to generating 
functions in random graphs. 



A. Generating functions approach for Markovian networks 



In graph theory, two-points correlations are usually expressed as functions of the degree by the degree conditional 
probability p(k'\k) that is the probability that a vertex of degree k is connected to a vertex of degree k' . This has 
led to the definition of a class of correlated networks, called undirected Markovian random networks |4fj| and defined 
only by their degree distribution p k and by the degree conditional probability p(k'\k). The function p(k'\k) satisfies 
the normalization constraint 

5>(*'l*) = 1 , (12) 

ft' 

and a detailed balance condition 143 



kp{k'\k)p k = k'p(k\k')p k , 



(13) 
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In uncorrelated graphs, p(k'\k) does not depend on k and it can be easily obtained inserting Eq. ^]into Eg. 1131 



p(k'\k) 



k'pk' 
(k) 



(14) 



In this context, we can assume that also the edge transition probability depends only on the degrees ki and 
kj of the extremities. Note that, while the analysis of the standard site (bond) percolation is based on the relation 
between the degree distribution pk and the degree-dependent node occupation probability q k , in the inhomogeneous 
joint site-bond percolation the relation is between the pair of distributions {pk,p(k'\k)} and the pair of probability 
functions {q k ,T kk ,}. 

A simple approach to inhomogeneous percolation is that of considering an approximated expression for the proba- 
bility that an edge emerging from a vertex of degree ki is traversed by the flow. By averaging over the contributions 
of all the second extremities of the edges emerging from a node, we obtain a degree-dependent effective average 
transmission coefficient r ki — ^2 k . T kikj p(kj\ki) (0 < r ki < 1). The advantage of this method is that of providing 
analytically solvable equations for the percolation condition. Moreover, even if the sum on kj introduces an approx- 
imation, the effective term r ki does not neglect the contributions due to the edge transition probabilities, weighting 
them with the correct degree conditional probability as required for Markovian networks. If these contributions are 
similar the approximation is very good, while when the edge transition probabilities or the nodes correlations are 
highly heterogeneous a different analysis should be applied (see Section lill B|l . 

According to this approximation, the generating function Fq(x; {q,T}) of the probability that a spreading process 
emerging from an occupied node flows through exactly m nodes (whatever their degrees are) is written as 



F (x;{q,T}) = 



PkiQk, 

ki=l 



fc,=i 



(15) 



This expression can be derived using simple arguments. The probability that exactly m of the ki edges rooted in i 
are connected to vertices of degree kj and that they are open (i.e. they let the flow pass along them) is 



h ) [r fcifcj p(fc J |fc i )] m [i-r fcifcj .p(%|fc l )] 

m / 



ki —m 



(16) 



If we average over the contributions of all possible second extremities, T kikj p(kj\ki) is replaced by J2 kj T kikj p(kj\ki); 
then putting together these terms with an x variable for each open edge, summing over all the possible degree ki and 
over all the possible values of m, we get 



oo oo 



F (x;{q,T})=J2 E 



Pkiq kl 



m=0 ki =m+l 
oo ki 

ki = l m=0 



ki = l 



( lk, 




1 + (x - 1) ^2 T kikj p{kj\ki 



fe,=i 



1 - }T kiki p{kj\ki 
1 - ^ Tk^pjkjlkj) 



ki —m 



ki —m 



(17) 



The value Fo(l; {q, T}) assumed by this function in x = 1 is the average occupation probability (q) = ^2 k Pkqk, that is 
consistent with the fact that summing over the contributions of all the possible amounts of emerging edges traversed 
by the flow means that we are simply considering the number of starting nodes, i.e. the average number of occupied 
nodes. The first derivative with respect to x computed in x = 1 



^F (x;{q,T}) 



= 2J hPkilki 2J TkikjPikjlki) 

x=1 ki=l 



(18) 



is the average number of open edges emerging from an occupied vertex. 
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Using similar arguments, we get the generating function Fi(x; {q, T}) of the probability that the flow spreading from 
a vertex, reached as an extremity of an edge picked up at random, passes through a given number of the remaining 
edges, 



Fi(x;{ 



kpk 



Qki 



fc,-=i 



ki-l 



(19) 



Following the notation presented in Appendix-lAl we call Hq(x) {q,T}) the generating function of the probability 
that a randomly chosen vertex belongs to a cluster of given size and H\(x\ {q, T}) the generating function of the 
probability that a randomly chosen edge leads to a vertex belonging to a cluster of a given size. They satisfy a system 
of self-consistent equations 



H (x; {q, T}) = 1 — F (l; {q, T}) + xF [H^x; {q, T}); {q, T}} 



(20a) 



H x {x; {q, T}) = 1 — {<?, T}) + ^ [H^x; {q, T}); {<?, T}] 
The expression for the mean cluster size (s) is obtained by derivation of Eq. I20al 



(s) = — H (x;{q,T}) 
ax 



= H^l;{q,T}) 



(20b) 



(21) 



where x — 1 implies the average over all possible degrees fcj. From Eg. I20al20bl using the relation H\(\; {q,T}) = 1, 
we get 



(s) = F (l;{q,T}) + F>(l;{q,T})Hi(l;{q,T}) , 
where the expression for -ff[(l; T 1 }) comes directly from deriving Eg. I20bl with respect to x in x = 1, 

H>(VS„ TW- Fl ^'fa T » 
ffl(1 ' {? ' T}) -l-F{(l;{ g) T}) • 

Inserted into Eq. [23 it leads to the well-known expression (Q) 

*o(i;{«,r}) 



( s ) = Fo(l;{ 9 ,T}) 



l-^(l;{g,T}) ' 



(22) 



(23) 



(24) 



hence, the giant component exists if and only if F{(1; {q, T}) > 1. Explicitly, it means that the generalized Molloy- 
Reed criterium for the inhomogeneous joint site-bond percolation on Markovian random graphs takes the following 
form, 



^ VkM [(h - Vjqk.T^k, ~ l] p(kj\ki) > 



(25) 



The threshold for the site percolation with inhomogeneous edges occurs when the equality holds. Inserting uniform 
occupation probability qi~ = q, its critical value is 



(k) 



Efe°=i ki{ki - l)p kl E^° = i T kiki p{kj\h) 



(26) 



However, in many natural, social and technological networks, the nodes represent individuals or organizations with 
very heterogeneous properties that are completely neglected looking just at a degree-level. For instance, in the Internet 
there are some non-physical relationships between the nodes (Autonomous Systems) due to policies or administrative 
strategies that have a deep influence on the spreading of information. Other examples come from social networks, where 
rather homogeneous intra-community environments contrast with very heterogeneous properties exhibited switching to 
other groups of individuals. These extra-factors cannot be easily incorporated into a statistical analysis and represent 
a challenge for the present research in complex networks. 

A natural attempt to include these properties in the formalism of degree-dependent generating functions consists of 
admitting the presence of different types of nodes, as assumed by Newman in Ref. [l2| . In the next section we derive 
some general results in the case of networks with multi-state nodes. 
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Generalization to networks with multi-state nodes 



Suppose that the nodes of a graph are divided into n classes, each one specified by a particular quality or state. 
Then, we consider the degree distribution p[ °f nodes of class h = 1, . . . , n, conventionally normalized on the relative 

set of nodes, i.e. X^-Pfc^ — 1- This condition ensures the normalization to 1 for the generating functions. Inside the 
classes there are no restrictions on the transition probabilities and they might be very different. 

Without entering the details of the derivation, that are reported in Appendix-lBl we consider the general case of 
inhomogeneous joint site-bond percolation in multi-state correlated random graphs. The generating function of the 
probability that a physical quantity, spreading from a vertex of class h, flows through a certain number of edges and 
reaches vertices in the same or in different classes is 



.,x n ;{q,T}) = F^ h \^,{q,T}) 



E(h) (h) 

fci=i 



p 



1=1 



k, 



(27) 



where is the occupation probability of a node of class h and degree fcj , T k h k ~* 1 ^ is the transition probability from a 
node of class h and degree fcj to a node belonging to the class I and having degree kj, , and p( h ^ l > (kj t \ki) is the degree 
conditional probability in multi-state markovian graphs. Similarly we compute the generating function F^ h \x; {q, T}) 
of the probability that a randomly chosen edge leads to a vertex of degree fcj, belonging to the class h, and from which 
the spread flows towards a given number of neighbours (of the same or different classes). It reads 



F[ h) (x 1>X2 ,...,x n] {q,T}) = F[ h) (x; {q, T}) = £ 



tot? 



t =i E fe k P. 



(h^l) (h^l) 



1=1 



fci-1 



(28) 

The two above quantities allow to write a system of self-consistent equations for the generating function (x; {q, T}) 
and H[ h \x; {q, T}) of the probability that a randomly chosen vertex of class h or an edge leading to that node belong 
to a connected component of a given size, 



H[ h) (x; {q, T}) = 1 — F^> (1; {q, T}) + x F^> [H\ k > (x; {q, T}), . . . , H?> (x; {<?, T}); {g, T}] 



(29a) 



4 h \x; {q, T}) = 1 — F^ h) (1; {q, T}) + x F^ [h[ 1] (x; {q, T}), (x; {q, T}); {q, T}} 



(29b) 



The condition determining the existence of a giant component is again the divergence of the the mean cluster size, 
and corresponds to the following generalized Molloy-Reed criterion for inhomogeneous joint site-bond percolation in 
multi-state Markovian correlated random graphs (see Appcndix-ITjl for details): 



det [/ - F] < 



(30) 



where T = V x Fi[x = 1; {q, T}] (whose elements are of the type ^-F^[l; {q,T}]). 

It is evident that such a general result strongly depends on which kind of node partition we are considering. Two 
examples of partitions are particularly relevant: a single class collecting all the nodes and a degree-based classification 
of the nodes. 

In the latter situation, each class gathers all vertices with a given degree and there is in principle an infinite number 
n of states (in finite networks there are as many states as the number of different degrees). Since in this case p\^^ = 1 



and 



ktPl 



1, Eas. 12712^1 become 



Fo kz \x 1 ,...,x n ;{q,T}) = q ki 
Fj; ki \x 1 ,...,x n ;{q,T}) = q ki 



1 + E( Xfc j ~ ^)Fkik 3 p(kj\ki) 



1 + /A x ki - tyTktkjPikjlki 



fei-i 



(31a) 
(31b) 
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The self-consistent system of equations for the generating functions of clusters size probability looks like in Eas. l29al - 
I29bl and the condition for the existence of a giant component is still the divergence of the mean cluster size, but now 
the elements of the matrix T are 

Ta = (V B Fi[x = 1; {g,T}]) y = {h - l)q k T ktk] p(k \h) . (32) 

The generalized Molloy-Reed criterion becomes 

det [(ki - ^qkiTk^pikjlh) - <%] > . (33) 

This expression corresponds to the criterion of Ref. ^| f° r the existence of percolation in correlated random graphs 
(apart from a matrix transposition), but with the difference that in this case we are dealing with inhomogeneous joint 
site-bond percolation, then both the degree-dependent node traversing probability and edge transition probability 
appear in the expression. The condition that percolation threshold is related to the largest eigenvalue (see Ref. [l3|]) 
is recovered if we assume that all nodes have equal traversing probability q ki = q = const. In other words, we are 
switching from a joint site-bond percolation to a simple site percolation with an occupation probability q. If q 
(otherwise the percolation condition cannot be satisfied), we can write the condition as 

q det [(ki - l)T kikj p(kj\ki) - A%] > , (34) 

with A = l/q. Since in q = the determinant is negative, the smallest positive value of q ensuring Eg. 1341 to be 
satisfied corresponds to the largest eigenvalue A max of the matrix (ki — l)T kik .p(kj\ki). It follows that the critical 
value of site occupation probability is q — l/A max . In the case T kikj = 1, the condition gives exactly the results by 
Vazquez et al. jl3j . 

However, the complete knowledge of the correlation matrix p(kj\ki) is very unlikely for real networks, and the 
analytical solution of Eq. [231 can be problematic also for simple artificial networks. In the other case, in which all 
nodes belong to a unique class, the single-state network is recovered and the analytical treatment coincides with that 
presented in the previous section. In the rest of the paper, all the presented computations and simulation results will 
be referred to this particular type of inhomogeneous joint site-bond percolation on single-state markovian networks 
or the corresponding explicit formulae for site or bond percolation. Note that, starting from Eg. 1251 with few further 
assumptions, other simpler versions of Molloy-Reed criterion are possible, some of which will be computed in the next 
section. 



C. Transition probability factorization, single- vertex dependence and uniform probability. 

The generalized Molloy-Reed criterion holds for two-point correlated networks (Markovian networks) , in which the 
condition for the existence of a giant component is related to the behaviour of the transition probability function 
T kik ., that in principle depends on edges properties and on the functional performances of the spreading process. 
Henceforth, we will focus on the problem of how different forms of transition probability affect the inhomogeneous 
percolation criterion. For clarity, only the case of all vertices belonging to a single class is considered. 

Although a plethora of behaviours for T kik . could be considered, the most natural assumption consists of their 
factorization in two single- vertex contributions. Let us suppose that the transition probability T kik . can be written 
in the form 

T kzk] = 9*(£; i )9 / (fe J ) , (35) 

where subscripts i and / indicate an initial and a final term respectively, in order to stress the fact that first and 
second vertices of an edge can give different contributions in the inhomogeneous percolation process. 
Inserting Eq. 1351 in Eq. 1251 the condition for the existence of a giant component becomes 

oo 

X) PkM [(h - l)ei(ki)Q f (kj)q ki - l]p(k j \k i ) > . (36) 

In the case of uncorrelated graphs, the conditional probability also factorizes in p(k'\k) = £t' , the generalized 
Molloy-Reed criterion becomes 



oo 

^2 Pi, Pi [(h - l)q kl T kik] - l] > , 



(37) 
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and Eq. |2Ugets simpler, leading to an interesting expression for the site percolation threshold, 

a = <*£ m) 

qc [(Pe 4 (fc))-(fce I (fc))](fce / (fc)} ' 1 > 

As an alternative, it seems interesting to study situations in which the transition probability is a function of only 
one of the two extremities of an edge. The case in which it depends only on final nodes means that 0i(fc) = const. = 1 
and T^kj — and the site percolation threshold takes the form 

{k)2 -9c 0m 777TT7^ , (39) 



yc [{*?)- {k)]{Mf(k)) Hc (ke f (k))> 

where q^ om is the value of the critical occupation probability for the correspondent standard homogeneous site 
percolation. The opposite situation, Qf(k) = const. = 1 and 0j(fc) = T k , leads to the following form for the 
Molloy-Reed criterion 

oo 

J2k[(k-l)q k T k -l]p k >0 . (40) 

k=l 

The inhomogeneous site percolation threshold q c follows directly imposing uniform occupation probability qk = q, 

(k) 

qc (m) (m) ■ (41) 

From Eq. 141)1 also the threshold's value for the inhomogeneous bond percolation is immediately recovered. If we 
suppose uniform transition probabilities Tk = T, indeed, we can explicit T in Eg. 1401 as a function of the set of {qk} 
and compute the value of the threshold as 

Tc = (k 2 q k )-(kq k ) ' (42) 

that represents the case in which percolation on the edges is affected by refractory nodes. Such a situation can have 
applications in the study of real networks, expecially in relation to edge removing procedures for estimating networks 
resilience but, in the rest of this paper, we limit our analysis to the site percolation. 

The last case we consider is that of uniform transition probabilities T^t. = T, with < T < 1. In the limit 
T = 1, the original Molloy-Reed criterion is recovered, otherwise, introducing T^kj = T in Eq. |23 wc find the same 
expression for the threshold of site percolation with noisy edges (as in Eq. ^2 with (T) = T) 

(43) 



(m/(k)-l)T 



IV. APPLICATIONS AND NUMERICAL SIMULATIONS 



In Section [IJ it has been stressed that the inhomogeneous percolation could be exploited to investigate the relation 
between structural and functional properties of networks. For example, compared to standard percolation, the inho- 
mogeneous one provides information on the functional robustness of a graph, that is the feedback of the graph to the 
presence of dynamical processes spreading on it. The topological robustness of a graph is principally studied looking at 
the size of the giant component, assuming that connectivity properties are sufficient to discriminate between "robust" 
and "fragile" graphs. In real graphs, as communication networks, our perception of the robustness is based on the 
observation of flows of information and other physical quantities: a connected component can be considered robust if 
a flow can span throughout it without stopping or causing any failure (e.g. in the transmission of an amount of data 
or energy). Since percolation properties are used for predicting the robustness or the vulnerability of a graph, in the 
following we will often translate percolation results to the more effective language of network's resilience, exploiting 
the equivalence between the percolation threshold and the critical fraction f c of removed vertices. As mentioned in 
Section [J the robustness is computed removing a finite fraction / of vertices (and the edges connected to them) and 
studying the size Qf of the giant component as a function of /. The removal of a node corresponds exactly to a 
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unoccupied node in the percolation model, then if the percolation threshold q c exists, the relation f c = 1 — q c holds 
between it and the threshold value of the fraction of removed nodes above which the graph does not admit a giant 
component. Since the presence of edge transition probabilities affects the value f c and the overall robustness of the 
network, decreasing the size of the giant component, it suggests that inhomogeneous percolation is a good paradigm 
for the study of the functional robustness. 

The functional dependence of the transition probabilities {T} on the vertices and the edges is strongly related to 
the specifics of the system and to the type of processes one is interested to describe. Therefore, in the absence of hints 
from studies of real data about spreading processes on networks, the present section is devoted to the highlight of 
some simple examples of functional forms for the transition probability, and to the illustration of their effects on the 
percolation condition. We limit our analysis to two main classes of random graphs: the homogeneous graphs, in which 
the connectivity distribution is peaked around a characteristic degree value, and the heterogeneous graphs, whose 
degree distribution is broad, with very large fluctuations in the degree values. Typical examples of this second class 
are scale-free random graphs with power-law degree distributions (pk — C(t) ^ 7 )i while for homogeneous graphs 
we will always consider poissonian degree distributions (pk — e _ W(fc) /fc!). We also present results of numerical 
simulations validating our analytical findings. 

The numerical simulations have been performed using efficient algorithms for percolation on graphs. In particular, 
the algorithms based on a depth- first search are those better encoding the idea of a spreading process that starting from 
a node pervades the network. For this reason, we preferred to use this type of algorithm. Moreover, agglomerative 
tree-based algorithms |59( can be correctly used only if edge transition probabilities are symmetric, otherwise we 
cannot be sure that the implicit direction of the spreading is conserved during the agglomeration process of the 
discovered rooted trees. Hence, after having randomly chosen a starting node, we perform a depth-first search and 
compute the size of the connected component the node belongs to. This operation is repeated for all the nodes of 
the network (excluding those already visited): the size of the giant component is recorded. Averaging over many 
realizations of networks, we get a statistical estimation of the size of the giant component for a networks ensemble 
with a given set of properties. 

A. How different forms of transition probability affect percolation properties 

In the following, we analyze the effects that different degree-dependent edge transition probabilities can have on the 
functioning of homogeneous and heterogeneous graphs, i.e. how they affect the functional giant component of these 
graphs. In particular, we take into account examples from three different situations: the case in which the transition 
probability depends on two factorized functions of the degrees at the extremities of the edge, the case of single- vertex 
transition probability monotonically increasing or decreasing with fc and the case of uniform or randomly distributed 
transition probability. Note that, for the sake of simplicity, in the following we always consider uncorrelated networks. 

1. Factorized degree- dependent transition probability 

Let us begin with a transition probability that increases with the degree: since the transition probability takes 
values in the interval [0, 1], we assume it tends to 1 for edges joining highly connected nodes, that is reasonable in 
dynamical processes such as the exchange of information on the Internet. For example, in communication networks, 
the hubs have very powerful and efficient systems to handle and re-direct information to the periphery and their 
mutual connections possess huge bandwidths compared to single user's connections. 

As an example of this situation, we consider Tkk' to be a generic homogenous function in fc and fc' (because the 
edges are undirected) and assume that it depends only on the product fcfc' of the degrees of the edge extremities. This 
is justified by the fact that, in many weighted networks, the transition probability can reasonably be a function of 
edge weights Ty = T(wij), since weights are usually measures of the flow and the traffic on the edges, or estimations 
of edges capacity. In this regard, a recent study by Barrat et al. [6(j on the architecture of weighted networks, 
analyzes the correlations between topological and weighted quantities in the real weighted network of world-wide air- 
transportations, providing an indirect justification for our choice. In fact, it turns out that edge weights are correlated 
to the degrees at their extremities according to a phenomenological law of the type wa y (fcjfcj) , a > (a ~ 0.5 for 
the airports network) . More recently, other papers on weighted graph modeling |6ll l62j have used this relation as a 
reasonable form for the correlation between weights and degrees. 

fcfc' 5 

According to this observation, we define transition probabilities Tkk' — 7i(^r) with S > and cut-off k. By 
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FIG. 3: The two figures show the behaviour of f c = 1 — q c from Ea . 1441 for a power-law random graph with cut-off k = 10 2 
(A) and 10 3 (B). The fraction f c of removed nodes required to destroy the giant component Qj is plotted as a function of the 
exponent 7 of the power-law when the transition probability reads T kk i oc (kk' / k 2 ) , with exponent < 8 < 1. Three values 



of 8 are explored: 8 = 0.1 (circle), 8 = 0.25 (squares) and 8 
homogeneous site percolation (limit of 8 — > 0) 



0.5 (triangles). Dashed lines show the solution for standard 



computing the threshold for site percolation with inhomogeneous edges directly from Eq. 1261 we get 

(fc)V A " (k)\ 2S 

qc(rY, ) T 1 ((k 2 + s k' 1+s )-(k^k' 1+s )) T 1 (fci+*>[(fc2+*> - (&+*)] ' ( ' 

Particularly interesting is the behaviour of q c for power-law graphs with degree distribution p k = £(7) k 1 , that we 
have computed by numerical summation. We also put T% = 1. In Fig. EI we report two sets of numerical curves giving 
f c = l — q c as function of the topological exponent 7 for fixed values of S and of the cut-off n: panel A displays the 
curves for k = 10 2 and 5 = 0.1 (circles), 0.25 (squares), 0.5 (triangles), while those for n — 10 3 at the same values 
of S are shown in panel B. Dashed lines show the behaviour of the threshold value for the standard site percolation 
(5 — > 0). The plots show clearly that the fraction of removed nodes sufficient to disconnect the giant component 
is considerably reduced when such a transition probability is taken into account and that the effect is emphasized 
by the cut-off. The results of simulations on power-law graphs with up to N = 10 6 vertices are reported in Fig. 0] 
Panels A-B show typical experimental curves for the giant component Gf as a function of the removed fraction of 
vertices / for two different exponents 7 = 2.2 (A), and 7 = 2.6 (B), cut-off value k = 10 2 and two different values of 
5 = 0.1, and 6 — 0.25. The curves for 7 = 2.2 give a threshold estimation that is in good agreement with the predicted 
values. In the case of larger exponent, on the contrary, the agreement gets worse with a clear overestimation of the 
percolation threshold. The criterion itself for numerically detecting the threshold value (i.e. a flex point in the shape 
of the curves) becomes somewhat arbitrary as shown by the lin-log plots of the curves in panel C. However, we have 
checked these data on different sizes and plotted the corresponding results in panel D of Fig. 0] The three curves 
represent numerical values of the giant component Gf as a function of the removed fraction of nodes for N = 10 4 
(crosses), A^ = 10 5 (pluses) and N = 10 6 (circles). The curves become steeper and steeper near the threshold as the 
size increases, meaning that only for very large graphs the theoretical values can be reached. Finally, the agreement 
between simulations and theoretical predictions are slightly better if we put a larger cut-off, for example k = lO 3 . 



2. Single-vertex dependent transition probability 

Another interesting case is that of the transition probability depending only on the sender's properties, i.e. only on 
the degree of the initial vertex. A first example is that of a monotonously increasing functional form of the degree, that 
saturates to 1 at large k values: we consider TV, = T ki = &i{k) ~ 1 — exp(— -m-), that can be applied also to infinite 
graphs. The parameter a controls the convergence to 1. For large scale-free networks, however, the mere presence 
of a saturation, rather than its rapidity, is sufficient for the system to behave as in the homogeneous percolation, 
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FIG. 4: A-B) The figures show the behaviour of the giant component Qf as a function of the fraction / of nodes removed for a 
power-law graph with N = 10 6 nodes (averaged over 100 realizations), exponent 7 == 2.2 (A) and 7 = 2.6 (B), and with edge 

transition probabilities Tj^t. = (-t^-) . Two values of S are explored: S — 0.1 (circles) and 5 = 0.25 (crosses). The cut-off is 

10 2 . The arrows point to the threshold's position predicted by Eq. 1441 
C-D) The two figures report data on the numerical accuracy of the analysis. Figure C displays a lin-log scale plot of curves in 
panels A and B, revealing that the determination of the percolation treshold is affected by the bad convergence of the curves, 
in particular for large values of 7. Figure D shows the strong finite-size effects on the data; we consider samples with identical 
parameters (7 = 2.2, 8 = 0.1 and k — 10 2 ) and different graph's size N = 10 4 (crosses), 10 5 (pluses)and 10 6 (circles). The 
arrows indicate the positions of the threshold predicted by Eq. 1441 



showing that graph's heterogeneity ensures a zero percolation threshold. From Eq. 1411 simple calculations lead to the 
expression of the threshold for site percolation with inhomogeneous edges, 

to" 1 ) (45) 



* c C(7 - 2) - C(7 - 1) + Li 7 _i(e-««7)/C(7-i)) - ^^(e^CM/C^- 1 )) ' 

in which Li^(z) is the polylogarithmic function X)fc°=i FT l^j - I n Fig. the behaviours of q c vs. 7 for some values of 
the parameter a are reported. They show that the critical fraction of removed nodes f c is exactly 1 for 2 < 7 < 3. 
Also for 7 > 3, the behaviour is qualitatively the same as for standard homogeneous percolation (and corresponding 
robustness measures) on scale-free networks. The reason is essentially that a transition probability converging to 1 
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FIG. 5: Critical fraction f c of removed nodes required to destroy the giant component Qf as a function of the exponent 7 for 
a power-law graph in which the transition probability function is single-vertex depending with the form Tk ~ 1 — exp(— jjjk)- 
The figure reports the behaviour of the expression f c — 1 — q c with q c computed from Eg. 1451 and correct for an infinite graph. 
The parameter a controls the saturation to 1 for highly connected nodes: the cases a — 0.2 (circles) and a — 1 (squares) are 
shown. For large values of the parameter o, the curves rapidly converge to the limit behaviour of homogeneous percolation 
(limit a — * 00). For a < 1, the curves shift from this limit, but in the range 2 < 7 < 3 diverging fluctuations ensure its 
robustness (and the existence of percolation) . 



for the highly connected nodes does not affect the network's properties if the network is infinite, because there is 
always a considerable fraction of nodes with optimal transmission capability. This condition is not trivially satisfied 
by finite graphs, and the presence of a cut-off in the degree can have a strong influence on the network's functional 
robustness. Two kinds of cut-offs on power-laws have been largely studied in literature: an abrupt truncation of the 
degree distribution at a maximum value k « N 1- 1 , or a natural exponential cut-off, i.e. pk ~ fc _7 e~ fc / K . 

While finite-size effects can be studied only by numerical summation, the expression for a degree distribution with 
exponential cut-off can be solved in a closed form introducing the corresponding pk in Eq. 1411 the critical percolation 
threshold is readily expressed using polylogarithmic functions as 



Z/i 7 _i(e «) 

QC ~ LiT^Pk) - LiT^FT) + Li 7 _ 1 ( e -<(T.«)) - Li 7 _ 2 ( e -<(7.'0) ' 



with ^(7, k) defined by 



1,1 \ 1 aLiJe ~) 

*(7, k) = - + - . 7 \ ' . (47) 



K 



Lz 7 _i(e «) 



The introduction of a cut-off actually reduces the effects of degree fluctuations, producing finite percolation threshold 
also in the range 2 < 7 < 3. The two methods of fixing the cut-off give qualitatively similar results about the decreasing 
behaviour of robustness (data not shown), even if natural exponential cut-offs produce slightly lower values. However, 
inhomogeneous percolation does not seem to enrich the scenario obtained by standard percolation, though, the case 
of slow convergence (a < 1) yields a reduction of the functional robustness. Abrupt truncation allows to consider 
examples that are equivalent to numerical simulations on computer-generated graphs and to compare them. In 
Fig. [BJ-A two groups of curves are reported, showing data of the numerical computation of the critical fraction f c 
for a power-law graph with different values of the cut-off k = 10 2 , 10 3 as a function of the exponent 7. Numerical 
simulations for power-law graphs with TV = 10 5 vertices (and 100 realizations) and cut-off k = 10 2 are reported in 
Fig. HJ1 (panels B-C). The curves in panel B refer to the case of 7 = 2.4 and a = 0.2, 1, they are in good agreement 
with the threshold values predicted by the theory. In panel C, the two curves refer to graphs with exponent 7 = 2.8 
and the same values of the parameter a. This time the actual collapse of the curves takes place at larger values of 
/ if compared with theoretical predictions. For a = 0.2 and 7 = 2.8 the threshold value should be close to / = 0.6, 
while numerical simulations give at least 0.7. 
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FIG. 6: A) The fraction f c of removed nodes required to destroy the giant component Qf as a function of the exponent 7 for 
a power-law graph when the transition probability function is single-vertex depending Tk ~ 1 — exp( — f^)- The parameter 
a controls the saturation to 1 for highly connected nodes. The figure reports the numerical computation of the expression 
f c = 1 — q c with q c from Ea. 1451 when, in the infinite graph, we put a cut-off on the degree. The cut-off values are re = 10 2 
(open symbols) and 10 3 (full symbols) and the control parameter is a = 0.2 (circles) and 1 (squares). The critical fraction is 
lower than 1 in all the range 2 < 7 < 3, but in agreement with the results for standard homogeneous percolation (dashed and 
dotted-dashed lines). 

B-C) The curves Gf(f) obtained by numerical simulations on power- law random graphs of size N = 10 and exponent 7 = 2.4 
(B) and 7 = 2.8 (C) for the two values of a = 0.2, 1 and cut-off re = 10 2 . The arrows indicate the positions of the threshold 
predicted using Eq. I4U1 



However, the estimation of the threshold value, as also shown in Fig.HJ is a very difficult task in power-law graphs, 
given that the shape of the curves decreases with constant convexity, converging very smoothly to zero. As we will see 
in the next example, simulations on homogeneous graphs provides considerably better results. A possible explaination 
for the occurrence of these biases comes from the fact that the probability of nodes with maximum degree (fc w k) 
is 0(1 /N), then in a finite graph their frequency depends on the size of the system. Compared to the homogeneous 
case, the traffic on heterogeneous graphs is clearly unbalanced in favor of large degree nodes, therefore we argue that 
finite size effects can be more relevant. Similar effects are visible also in the case of optimal transmission (T = 1 for 
all edges), albeit they are emphasized when the global efficiency of the network is decreased. Moreover, this argument 
agrees with the progressive steepening of the curves for larger sizes as shown in panel D of Fig. HJ 

Structural properties of homogeneous graphs, on the contrary, are always deeply affected by this type of transition 
probability, expecially when perfect transmission is reached beyond the peak of the degree distribution (a < 1). The 
threshold for the site percolation with inhomogeneous edges is a function of the average degree z — (fc), with the 
parameter a adjusting saturation's rapidity, 

? c ( a > z ) = — rZ 2 (c i p( _» ) _ 1) _ 2ail • (48) 

z[l - exp[ ( p( f ' - ]] 

We have plotted in Fig.|7J-A the corresponding curves for the critical fraction of removed nodes f c determining network's 
collapse. The results of numerical simulations on Erdos-Renyi random graph with TV = 10 5 vertices and (fc) = 10 are 
also shown in Fig. (panels B-C). The three curves in panel B represent the size of the giant component Qf as a 
function of / for different values of the parameter a — 0.2 (circle), a = 1 (squares) and a = 5 (triangles). The figure C 
shows the good agreement between the numerical values of the percolation threshold (points) obtained by simulations 
and the correspondent theoretical predictions (lines) for different values of the average degree (fc) = 5, 10, 20. 



Another wide class of transition probabilities contains those converging to zero with increasing degree. At a 
first glance, it seems a very unphysical condition, but the problem can be inverted asking which is the maximal 
decay in the degree dependence still ensuring site percolation. For instance, exploiting bond percolation to study 
epidemic spreading, Newman has shown j3fj] that if transmissibility decreases as 1/fc or faster, there are no global 
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FIG. 7: A) Fraction f c of removed nodes required to destroy the giant component as a function of the average degree (fc) for 
an Erdos-Renyi random graph with single-vertex transition probability Tj, ~ 1 — exp(— fix)- The curves correspond to different 
values for the control parameter a: a — 0.2 (circles), 1 (squares) and 5 (triangles). Highly connected random graphs are 
extremely robust, but many real networks have low average connectivity, range in which the robustness decreases considerably. 
B-C) Panel B displays the size of the giant component Qf as a function of the removed fraction / of nodes obtained by 
simulations on Erdos-Renyi random graphs of N = 10 5 nodes (averaged over 100 realizations), average degree (fc) = 10 and 
parameter a — 0.2 (circles), 1 (squares) and 5 (triangles). In panel C, numerical values (points, obtained by simulations) and 
theoretical predictions (lines) of the corresponding percolation threshold for Erdos-Renyi random graphs with different average 
degree (fc) = 5, 10, 20. 
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FIG. 8: A) Fraction f c of removed nodes required to destroy the giant component as a function of the exponent 7 for a 
power-law graph in which the transition probability is the single- vertex function 71- ~ fc~ Q , with exponent < a < 1. The 
curves are report the values of f c = 1 — q c with q c computed by Ea. 1491 All curves for a — 0.25 (circles), 0.5 (squares) and 
0.75 (triangles) show that the network's robustness is highly diminished compared to the homogeneous result (dashed line), 
also for an infinite graph. In particular, a finite threshold lower than 1 appears also in the range 2 < 7 < 3. 
B) Threshold value f c of removed nodes as a function of the parameter a for power-law graphs with cut-off k = 10 2 and exponent 
7 = 2.3 (full line and circles) and 7 = 2.6 (dashed line and crosses). The symbols (circles and crosses) are results from simu- 
lations on random graphs of size N = 10 5 (average over 100 realizations) ; the lines are the correspondent theoretical predictions. 
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outbreaks. Here, we can easily recover the same result expliciting the threshold value with the more appropriate tool 
of inhomogcneous site percolation. For single- vertex transition probability T^, Eg. 1411 implies that, independently of 
the degree distribution, a graph does not admit percolation if Tk decays as T\/k a with a > 1. Indeed, by substitution, 
q c = (k)/[T 1 ((k 2 - a ) - (fc 1 -"))], that is larger than 1 because (k)> {k 2 ~ a ) > (fc 1_Q ), and T x < 1 (assuming (k) > 1). 
This is actually an equivalent result of that presented in Ref . [3(| . 

Since a = corresponds to standard percolation, we are interested in the intermediate case < a < 1. In scale- free 
networks the percolation threshold is computed by series summation, 

to" 1 ) (49) 



HC C( 7 + a-2)-C(7 + a-l) ' 

that has been plotted in Fig. [SI A as a function of 7 for different values of a between and 1. The curves give evidence 
to the fact that a transition probability moderately decreasing with the degree can induce power-law graphs with 
2 < 7 < 3 to present a finite percolation threshold. However, many real networks (Internet, WWW, etc.) have a 
power-law degree distribution with exponent lower than 2.5, for which the giant component persists even for relatively 
large a (at least in the approximation of infinite systems). Nevertheless, the resilience is reduced in finite systems or 
in presence of a cut-off on the degree as shown by the results of simulations reported in Fig. [SJ-B. The figure shows 
the behaviour of the threshold value f c as a function of the parameter a for two different exponents 7 = 2.3, 2.6. The 
lines are the theoretical predictions for (infinite) systems with cut-off k — 10 2 , the points are numerical results on 
finite systems with N — 10 5 and same cut-off. As for the other results of simulations on power-law graphs that we 
have presented, the threshold values f c for finite sizes are always larger (i.e. q c is smaller) compared to the theoretical 
predictions for infinite graphs with the same cut-off on the degree. This seems to be a characteristic of power-law 
graphs and would require a separate accurate analysis for increasing sizes N of the graphs. 



This collection of examples is far from being complete, and it should rather represent a way to better understand 
potential applications of calculations and formulae presented in Section II I II In particular, among other possible 
expressions for the transition probability, we would like to mention non-monotonous or peaked degree-dependent 
transition probabilities, with an optimum of transmissibility for (not necessarily large) characteristic values of the 
degree. For a deeper understanding, let us consider the spreading of a virus on the Internet: highly connected 
nodes have large exchanges of data with their neighbours, for that they should be potentially the nodes with highest 
transmission capability. On the other hand, there is a common awareness of their importance, thus they are better 
protected and controlled, leading to a considerable reduction in the effective transition probability for the spreading 
of viruses from these nodes to their neighbours. Very low-degree nodes are less protected but also less exposed to 
the transmission (their data exchanges are limited) . A similar scenario suggests that also non-monotonous transition 
probabilities are actually interesting in the study of dynamical processes on networks. 



3. Uniform and random distributed transition probabilities 

In Section IIIII we have shown that the generalized Molloy-Reed criterion reduces to the results obtained with 
the intuitive arguments of Section ITT1 through successive assumptions. Indeed, as stressed in Ref. :3Q'j, uniform 
transition probability is far from being an unphysical condition: indeed, in uncorrelated random graphs, {T^} are 
well approximated by i.i.d. random variables and, in principle, the probability to flow along an edge (bringing physical 
quantities as information, energy, diseases, etc.) is the average value (T) = T over the {T!y } distribution. Hence, we 
expect that, in random graphs, the average behaviour of site percolation with non-optimal transmission on the edges 
(with random ) is well predicted by Eq. 021 where T = (T) is the average edge transition probability. 

This statement has been checked by simulating percolation process on an Erdds-Renyi random graph with (k) = 10 
and on a power-law random graph with exponent 7 = 2.3. Both graphs have N = 10 5 nodes and the transition 
probabilities are assigned randomly between and 1. Consistently with the data presented in the previous sections, 
Fig. [!5JA displays the behaviour of the network's robustness under random failures. The critical fraction of removed 
nodes is related to the site percolation's critical occupation probability by f c = 1 — q c . Panel A reports the size of the 
giant component of the Erdos-Renyi graph as function of the fraction / of removed vertices for (T) = 0.25, 0.5, 0.75, 1; 
the predicted values f c = 0.6,0.8,0.867,0.9 are well reproduced by simulations. The same measures for a power-law 
graph are shown in Fig. ^JB. The curves in the simulations arc in good agreement with the theoretical values of the 
percolation threshold f c ~ 0.613 (for (T) = 0.25), 0.80 (for (T) = 0.5), 0.871 (for (T) = 0.75), 0.9 (for (T) = 1.0). 
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FIG. 9: Size of the giant component Qf as function of the fraction / of removed vertices for (T) — 0.25, 0.5, 0.75, 1 for an Erdos- 
Renyi random graph (A) with N = 10 5 vertices and (k) — 10 and a power-law random graphs (B) with same size, exponent 
7 = 2.3 and cut-off k = 10 2 . All curves have been averaged over 100 realizations. The predicted values of f c = 0.6, 0.8, 0.867, 0.9 
are well verified in the Erdos-Renyi graph by the values of / at which Qf ~ 0(log N). The power- law graph gives slightly worse 
results, but still in agreement with the theoretical values f c — 0.613, 0.80, 0.871, 0.9 (that are indicated by arrows on the curves). 



As a final remark on what we have called "functional robustness", we should note that other damage strategies, 
e.g. intentional attacks by degree-driven node removal, can be implemented also in the context of inhomogcneous 
percolation. The study of intentional damage to complex networks has been the topic of many research efforts 
0i I23L loll |65|, leading to striking analytical and numerical results on the vulnerability of networks. It has been 
shown that scale-free networks are very resilient to random failures, but extremely vulnerable if the attack is brought 
following a targeted strategy. In particular, Cohen et. al. [13 have studied analytically the effects of the removal of 
the most connected vertices, showing that scale-free networks are fragile to degree-driven intentional damage. When 
transition probabilities are present, edges are more refractory to transmit physical flows, therefore scale- free networks 
fragility should be even more emphasized. In order to show this behaviour, we apply to the generalized Molloy-Reed 
criterion exposed in Eq. 1401 the semi- numerical method proposed in Ref. |64| . We restrict the analysis to two simple 
cases: intentional attacks to the hubs, and a systematic variation of the attack strategy following a law of preferential 
selection of the nodes. 



Attacks targeted to hit the hubs in networks where the spread of the damage is influenced by edge transition 
probabilities can, in principle, result in a different threshold behaviour. As a first example, we consider the case in 
which Tij = = Tj~. The intentional damage removes vertices with k > «(/), where the cut-off k(J) depends 
implicitely on the fraction of removed nodes 



B. Networks robustness to intentional degree-dependent attacks 



1. Attacking the hubs 



K 




(50) 



k=l 



the condition for the percolation threshold becomes 



«(/) 




(51) 



k=l 



k=l 



The presence of Xfc actually modifies the threshold's value f c obtained in the case of optimal transmission. According 
to different functional expressions of Eg. 15 II provides different behaviours for the critical fraction f c . All functions 
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FIG. 10: Fraction f c of removed nodes required to destroy the giant component as a function of the exponent 7 for a power-law 
graph summitted to intentional attack by removing highest connected nodes. Panel A reports (as full lines) the curves / c (7) 
when transition probability has the form Tk ~ k~ a , with exponent a = 0.2 (circles), 0.4 (squares), 0.6 (up triangles). Panel 
B reports the same set of curves for different values of uniform transition probability T = 0.2 (circles), 0.4 (squares), 0.6 (up 
triangles), 0.8 (down triangles). In both panels the case of perfect transmission that corresponds to the standard homogeneous 
case is plotted in dashed lines. 



Tfc that are monotonously increasing with fc give results consistent with standard percolation. On the other hand, for 
Tfc ~ k~ a with a > 1 percolation does not occur; therefore, we consider < a < 1, that is a transition probability 
slowly decreasing with fc. Eg. 1511 becomes 

K K 

fc=i fc=i 

that solved numerically for different values of a = 0.2, 0.4, 0.6 gives the curves displayed in Fig.llOtA . Their qualitative 
behaviour is the same as for constant transmissibility. 

Suppose now that transition probabilities Xy do not depend on the degree, but are distributed randomly, then 
the effective transmissibility is well-approximated by its average value (T) (see Section TlV A 3|l . Alternatively, one 
can suppose a uniform transition probability T = (T) . Introducing it into Eq. 1511 together with the expression of a 
power-law degree distribution pk = C(l) ^ 1 ■> we nn d the relation 

E^-fcr^+Efc 1 -^ (53) 
fc=i ^ * fe=i 

that provides ^(7, (T)). Then, f c (l, (T)) follows from Eq. 1201 The equation can be solved numerically, leading to a 
family of curves for {/ c (7)}^) that are displayed in Fig. I10I B. The computation is an extension of previous results 
[64| . to which our curves tend for (T) — * 1. It is interesting to note that, in both the cases, decreasing (T) implies a 
decrease in f c , causing the network to be more and more fragile to intentional degree-driven attacks. 

According to our computations, scale-free networks seem to be "pathologically" fragile with respect to degree- 
dependent removing of nodes, and transition probabilities actually play only a marginal role in determining network's 
vulnerability under intentional degree-driven attack strategies. 



2. Systematic variation of attack strategies 

Another type of resilience analysis is provided by a numerical study by Gallos et al. |6^] on the tolerance of scale- 
free networks to degree-driven attacks with variable strategies from friendly to intentional attacks. A friendly attack 
consists of the removal of preferentially low-degree nodes, while intentionally attacks are aimed to hit the hubs. The 
random failure is the intermediate case between these two extremes. In Ref. [65| . the systematic variation of attack 
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strategy is encoded into a probability W(k) of choosing a node to be destroyed with a certain fixed probability, that 
is modeled as a preferential rule 

with — oo < ft < +00. The authors shows numerically that for ft > the effects of the attacks are progressively more 
destructive as ft is increased, while for ft < the fraction of removed nodes / necessary to destroy the network is very 
close to 1. 

In our context, preferential attack strategies can be associated to a bond percolation process with power-law 
occupation (or traversing) probability qt ■ In terms of percolation, indeed, the fraction / of removed nodes is written 

K 

f = 1 - ^PkQk , (55) 
fe=i 

with gfc oc 1 — W(k); then the cut-off «(/) is a function of / as in Eq. [501 The correct form of the generalized 
Molloy-Reed criterion comes directly from Eq. 0U]with T\. — 1 Vfc, and states that a giant component exists when 

«(/) 

MO ~ 1)9* ~ l]Pfc > • (56) 

k—1 

Requiring the equality, it is possible to compute numerically the threshold value f c — / c (7, ft) of removed nodes. Very 
recently, Gallos et al. ( 66] ) have obtained an approximated expression for the percolation condition in the case the 
attacks are sequentially brought. They have evaluated the probability p(k) = (1 — W(k)) that a node of degree k 
is still present after d node removals. However, if we assume that the attacks are simultaneously launched, the only 
important quantity is the fraction / of removed nodes, while the survival probability is p(k) = (1 — W(k)) = qk 
(d = 1). In this limit the previously exposed approach is completely equivalent to that in Ref. |66j . An interesting 
improvement of this method should be the study of networks resilience in the case of inhomogeneous edges (T^ =/= 1). 



V. CONCLUSIONS 



Percolation theory provides powerful methods to study statistical properties of physical systems and, recently, 
it has been exploited to uncover structural features of real complex networks such as the Internet. Nevertheless, 
standard homogeneous percolation fails in pinpointing the interplay between the topology and the properties of the 
flows and other dynamical phenomena taking place on it. Typical examples are spreading phenomena: frequently 
modeled by (homogeneous) percolation, the real processes possess features that cannot be encoded in its formalism 
because spreading rates are non homogeneous and depend on nodes and links characteristics. To model more realistic 
processes, in which the flow of information, energy, or any other physical quantity depend on the transmission ability 
of nodes and edges, we have introduced inhomogeneous edge transition probability and node occupation probability 
and studied the corresponding joint site-bond percolation process. The influence of inhomogcncity is reflected not 
only in the structural properties but also in the network's functionality and efficiency. 

Inhomogeneous percolation has been analytically studied on random networks with generating functions techniques, 
obtaining explicit expressions for the site and bond percolation thresholds as a function of both the topology and the 
transition probability. The main result is the generalization of the well-known Molloy-Reed criterion for the existence 
of a giant component (i.e. of percolation) for a Markovian correlated random graph with arbitrary degree distribution. 
The analytical study considers also the very general case of multi-state nodes, that seems to be interesting for social 
networks, where the existence of community structures is well-established. Simpler situations regarding correlated and 
uncorrelated random graphs composed by identical nodes have been also studied. In particular, explicit expressions for 
the critical threshold of the site percolation with inhomogeneous edges have been computed in the case of uncorrelated 
random graphs. 

In addition, percolation properties have been used to study the vulnerability of uncorrelated random graphs to 
random failures revealing that their functional robustness can be seriously reduced if the disparity of edge transition 
probabilities are taken into account in computing the effective giant component. Analytical results have been checked 
by numerically simulating percolation on homogeneous and heterogeneous uncorrelated random networks. 
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Many real weighted networks such as infrastructure or technological networks, present a relation between the weights 
and the actual flow of physical quantities (information, energy, goods, etc.) carried by the links. For this reason, they 
seem to represent a reasonable application's field for inhomogeneous percolation. Furthermore, a better knowledge 
of real data about Internet's traffic, diseases spreading and other flows on networks could provide major insights on 
what type of transition probability functions are actually relevant. 

In conclusion, beside the purely theoretical interest for a generalization of percolation on random graphs and 
its analytical solution, this paper provides a method to incorporate into the structural properties some aspects of 
the functional description of networks. Therefore, the implications of this new approach go beyond the details of 
the present study, and interest directly the recent and very stimulating issue of the interaction between network's 
topology and dynamical phenomena occurring on it. 
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APPENDIX A: GENERATING FUNCTIONS IN PERCOLATION PROBLEMS 



In order to better specify the notations used in the main text, we recall some basic notions about generating 
functions, considering only the case of infinite graphs without isolated vertices. The generating function for the 
degree distribution p k of a randomly chosen vertex is 



G (x)=]Tp fe x fc , (Al) 



fe=i 

with Go(l) = 1, and (k) — ^2 k kp k — G' (l). Similarly, the generating function for the probability that a randomly 
chosen edge leads to a vertex of given degree is 

Efc k Pk% h 1 _ r M _ G' (x) 

A useful property is that the probability distribution and its moments can be computed by simple derivative of the 
corresponding generating function. 

If we call % the probability that a vertex of degree k is occupied (or node traversing probability if regarded as a 
spreading phenomena), the probability that, choosing randomly a vertex, we pick up an occupied vertex of degree k 
is the product of the probabilities of two independent events, i.e. p k qk- Repeating the same operation with the edges, 
we need the probability that the randomly chosen edge is attached to an occupied vertex of degree k. This event 
happens with probability kp k qk/(k}- Hence, we define the generating functions for both these probabilities that are 
very important in the site percolation, 

oo 

F (x; {q}) = J2Pk1kX k , (A3a) 
fc=i 

The function F (x;{q}) is the generating function of the probability that a vertex of a given degree exists and is 
occupied, while Fi(x; {q}) is the generating function for the probability of reaching a vertex of a given degree starting 
by a randomly chosen edge and that it is occupied. 

The solution of the site percolation problem is the set of values {q k } for which an infinite cluster (giant component) 
exists. The same analysis holds if we collect the nodes on the basis of other properties, but in the following only the 
degree-dependent formalism will be developed. 

In order to compute the probability that a randomly chosen vertex belongs to the giant component, we start by 
computing the probability P s that a randomly chosen vertex belongs to a connected cluster of a certain size s. The 
use of generating functions allows to do it simultaneously for all the possible sizes. Then, the mean cluster size is 
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obtained as the first derivative of the generating function of P s . Finally, the condition for the divergence of the mean 
cluster size gives the condition for the existence of the giant component as a function of the parameters of the system, 
that are the degree distribution and the node occupation probability. 

Firstly, we consider the probability P s that a randomly chosen vertex in the network belongs to a connected cluster 
of a certain size s. We call Ho(x] {q}) its generating function, 



H (x;{q})=J2P s 



(A4) 



s=0 



in which we have conventionally grouped in the term for s = the probability 1 — qkPk that a vertex is not 
occupied. Similarly, let P s be the probability that a randomly chosen edge leads to a cluster of a given size s, and 
Hi(x; {q}) its generating function. 

Since we choose the starting vertex at random, each possible degree k gives a different contribution to each possible 
cluster size probability P s , meaning that each term P s x s is itself given by an infinite sum of terms labeled by the 
degree k. For instance, the first terms of Hq(x; {q}) are: 



Po 

Pi a; 
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(A5b) 
(A5c) 
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1=1 



1 -i 



;>3 



Now, summing these terms and grouping similar contributions we get 



Ho(x;{q}) = 1 - ^Pkqk +pmx 



k 

+ P2q 2 x [P 2 + 2P P lX + (Px) 2 + 2P P 2 x 2 



P a +P!X + P2X 2 



k 



PkQk +p\q\x 



P0 + P1X + P 2 x 2 



- 2P lX P2X 2 + {P 2 x 2 ) + . 
p 2 q2X Po + P\x + P 2 x 2 



1 - ^Pkqk + xp 1 q 1 H 1 (x\ {q}) + xp 2 q 2 [H 1 (x; {q})]" 



(A6) 



Note that no term contains po9o according with the convention of considering only non isolated nodes. A compact 
form for this expression is written using Eq. IA3al 

P (z; {<?}) = 1 - Po(l; M) + xFoiH^x; {q}): {q}) . (A7) 

The structure of Eq. IA7I can be represented diagrammatically as shown in Fig. 1111 associating the variable x to each 
"bare" vertex and a variable H(x; {q}) to each "dressed" vertex, while the function Po weights the contributions over 
all possible degrees. 

Moreover, the generating function Hi(x; {q}) satisfies a similar self-consistent equation, 



Ht (x; {q}) = 1 - Pi (1; {q}) + xF 1 (H, (x; {q}); {q}) 



(A8) 



that is obtained following completely similar arguments starting from picking up an edge at random. 

The mean cluster size (s) is computed directly from Eq. IA7I as the first derivative of Hq with respect to x in 
x = 1. Imposing its divergence allows to find the condition for the existence of a giant component, that corresponds 
to the Molloy-Reed criterion as presented in Ref. 0] ■ Finally, considering a uniform occupation probability qu = q (or 
uniform node traversing probability), this criterion gives the expression for the site percolation threshold q c . 
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FIG. 11: A) A full dotted bullet with dashed contour line corresponds to the probability that a vertex is unoccupied. This is 
given by a particular series of diagrams, in which we sum the contributions of unoccupied vertices of all possible degrees. A 
striped bullet with full contour line represents the generating function Fq{x; {q}), whose diagrammatical expansion contains 
all possible combinations of occupied vertices reachable by an occupied vertex with a certain degree. The x accounts for the 
occupation of a vertex. The contributions of the isolated vertices {polo) have been deleted in agreement with our convention of 
considering only graphs with non isolated vertices. B) Diagrammatical representation of the generating function Ho(x). The 
first terms of the infinite series correspond to the summation of the Eas. IA5l as presented in Ea. IA6I These contributions can 
be expressed in a compact form using Eg. IA7I that contains the generating functions Fo(x; {q}) and H\(x). 



APPENDIX B: INHOMOGENEOUS PERCOLATION IN GRAPHS WITH MULTI-STATE NODES 

Consider a Markovian correlated graph with multi-state vertices, i.e. the vertices are divided into n different classes 
or states, and suppose that each vertex is given an occupation probability depending on its degree and each edge is 
endowed with a transition probability depending on the degrees and the states of the extremities. The fundamental 
brick for the construction of generating functions in correlated graphs is the rooted edge composed of a starting vertex 
i and the pending edge connecting it to a second vertex j, without explicitly considering this second extremity. 
For this reason we will always average on the degree of the second extremity of the edge. Let us consider a vertex 
i chosen at random, it will be characterized by a class h and by a degree ki. In principle, the ki edges departing 
from that node are connected to ki other nodes belonging to different classes. Actually, only njj of them are really 
reached by a flow because of the presence of transition probabilities on the edges (we call such transmitting edges 
open). Therefore, they identify a partition of {mf'\rnf'\ . . . ,to|™'*}, with Y], mf = rrii < ki, in which mp is the 
number of these neighbouring nodes belonging to the class I and linked to i by an open edge. 

Suppose that m!p of the ki edges emerging from a node of class h and degree ki are successfully connected to 
nodes of a same class I and (possibly different) degrees kj. The average probability that an edge among them allows 
the flow to pass is J2k- ^fc'fc^'' 'p*-' 1 ^ {kj \ki), where p( h ^ {kj \ki) is the degree conditional probability between vertices 

of states h and I and T^^*^ is the transition probability along an edge from a node of degree ki in the class h to 
a node of degree kj in the class I. The origin of this term is trivial: the probability to pass along the edge is the 
product of two independent events, i.e the edge exists and it is open; then, being interested in rooted edges, we 
have to average over all possible degrees kj. The probability that there are rnp of these edges produces a term 

Efe, ^fc fc P {kj\ki)\ . Positive events give n contributions of this kind, while the ki — rrii negative events 

contribute to a single term 1 — J2?=i J2kj ^fc ( fe- P (kji > that is the probability that ki — rrii edges do 

not admit the flow's passage whichever class they belong to. Computing the probability of the whole event associated 
with the partition {mf^ — ki — n%i,m\ ,m^\ . . . ,m\ } of the neighbours of the node with degree ki, we get the 
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multinomial distribution 

pW (fc . ){m W }) = fc ., i 
m) '< 



3l=l kj. 



k i kj l V 



[kj l \ ki) 



n 1 

n4 



CO, 



ki, 



(Bl) 



A simpler version of this multinomial distribution appears in Ref. [12 . 

The following step consists in using the expression of the multinomial distribution to obtain the generating function 
of the probability that a physical quantity spreading from a vertex of class h successfully flows through {m^} of 
its edges that point to vertices in the class {1} (1 = 1,2,... ,n). Summing over all possible values of ki and over all 
possible partitions of ki in n + 1 values {m^}, we obtain the generating function 



F^ h \xi,x 2 ,...,x n ;{q,T}) = 



ki=l 



""""" ^ S(h, > ,„ 

{mf } 



1=0 



(0 



)P {h \h,{m ( i l) })l[xf 



(B2) 



1=1 



in which is the occupation (traversing) probability of a vertex belonging to the class h with degree ki, S(-,-) 
is a Kronecker's symbol and the x\, . . . ,x n variables represent the average contributions of the rooted edges of the 
different classes. Introducing Ea. IB II in Ea. IB2I the sum over the partitions {ml } corresponds to the extended form 



of a multinomial term, providing the following expression for the generating function 



F^ ( Xl ,x 2 ,...,x n , {q, T}) = (x; {q, T}) = £ p£>gW 



i=i 



k, 



(B3) 



With a completely similar argument, we can compute the generating function F± (~x; {q, T}) of the probability 
that a randomly chosen edge leads to a vertex of class h from which the spread toward its neighbours successfully 
flows through {to'} edges pointing to nodes of class {1} (1 = 1,2,... ,n). Hence, observing that now the number of 
emerging edges available to the spreading process reduces to ki — 1 and that the probability to reach the starting 

vertex (from an edge pointing to a generic vertex of class h) is 



CO ' 



the generating function F^ h \x.; {q,T}) reads 



F[ h) (x 1 ,x 2 ,...,x n ;{q,T})= F± n> (x; {q, T}) 



OO 



(h) 

( h ) 



i=i 



(B4) 

As recalled in the previous subsection, the two generating functions are useful in the computation of a system of self- 
consistent equations (similar to those in Eas. lA7IA8|) from which the expression of the average cluster size (s) should be 

derived. The main difference concerns the form of the generating functions Fq^x; {q, T}) and F^' (x; {q, T}), that are 
partitioned in classes (of nodes in different states) and contain the contributions of the transition probabilities. Firstly, 
we consider the probability Ps that a randomly chosen edge leads to a vertex of class h belonging to a connected 
component of a given size s. Its generating function H± (x; {q, T}) = J2 S Ps h ^x s satisfies the self-consistent equation 



H[ h) (x; {q, T}) = 1 — ff> (1; {q, T}) + x f[ K) [H^ (x; {q, T}), (x; {q, T}); {q, T}} , (B5) 

where the presence of H[ (x; {q, T}) for all h = 1, . . . , n on the r.h.s. means that the constraint on the value of h is 
required only on the starting node, not on the others reachable from it. Moreover, x refers to the cluster distribution 
and does not need any label. 

The first term in the r.h.s. of Eci. lB5l is due to the probability that the node of class h to which a chosen edge leads 
is not occupied, therefore it should not depend on the transmissibility of any outgoing edge. As required, the term 
1 - F^ h) (l; {q,T}) computed in x = 1 does not depend on {T}. This corresponds exactly to the term Pq in the 
cluster expansion. The second term of Eq. IB 51 refers to the contribution of an occupied vertex. Let us suppose that 
its degree is ki and consider one of its outgoing edges leading to a vertex in the class /: its contribution is given by the 
probability 1 — J2k- T^k*^ P^ h ~* 1 ^ (^ji that the flow does not reach the second extremity of the edge, and by the 
probability that it passes (%2 k p( h ^ (kj\ki)). The latter has to be multiplied by the vertex function associated 
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to the probability that this second vertex (of class I) belongs to a cluster of a given size. This probability is generated 
by the function H[ l \x; {g, T}) that is the correct vertex term for this contribution. Then, all these quantities have 

to be averaged over the set of degrees fej, with weights pi/ and Qu9 , easily recovering the second term on the r.h.s.. 
Since the spirit of the derivation is completely the same, we refer again to Fig. llll for a diagrammatical representation 
of the Eg. IB5I f details are different). 

The other equation, for the generating function Hq (x; {q, T}) of the probability that a randomly chosen vertex of 
class h belongs to a cluster of fixed size s, reads 



(x; {q, T}) = 1 - F W (1; {q, T}) + x F W [H^ (x; {q, T}) , . . . , H^> (x; {q, T}); {q, T}} 



(B6) 



Note that, by definition, both Hq H \x; {q,T}) and (x; {q,T}) are 1 in x = 1 for all h. 

Now, taking the derivative of Eci. lB6l with respect to x in x = 1, we obtain the average number of vertices reachable 
starting from a vertex in the class h, 



(h) 



dF, 



CO 



dxi 



Hi l) \l;{q,T}). 



(B7) 



The second term in the r.h.s. contains linear contributions from other classes of vertices, therefore Eg. IB7I can be 
written in a matrix form (in the n x n product space generated by pairs of classes) as 



(s)=V x n (x;{q,T}) 



F [l;{g,T}] + V x F [x; {q,T}\ 



x=l 



V^HiOr^T}) 



x=l 



(B8) 



x=l 



with s = (si, S2, . . • , s n ). Taking the derivative of Eo. lB5l with respect to x in x = 1, we obtain an implicit expression 
forffW'(l;{«,T}) 



d 



H[ hy (l; {q, T}) = [1; {q, T}} + ^ ^-F™ [1; {g, T}]h[ 1) ' ' (1; {q, T}) , 



and putting together the contributions in a matrix formulation, we get 



V.Hi^feT}) 
Explicitly, Eq. IB 101 becomes 



F 1 [l;{q,T}] + V at F 1 [x ] {q,T}] 



V x H x (a:;{ g ,T}) 



V.Hx^^T}) 



= [/ - V x F x [x = 1; {q, TjT 1 ■ Fi[l; {g,T}] = [/ - T]' 1 ■ F^l; {<?, T}] 



with = V x Fi[x = 1; {g,T}]. Introducing the expression in Ea. IB8I wc obtain 

(s) = F [l; {q, T}] + V x F [x = 1; {q, T}] ■ [I - T]' 1 ■ F x [1; {q, T}] 



(B9) 



(BIO) 



(Bll) 



(B12) 



The condition for a giant component to emerge consists in the divergence of at least one of the components of (s), 
that corresponds to the vanishing of the determinant of the matrix [I — J-]. 
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